Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition
نویسندگان
چکیده
interaction of Linear Discriminant Analyand a modeling approach using continuous mixture density HMMs is studied experimentally. The largest improvements in speech recognition accuracy could be obtained when the classes for the LDA transform were defined to be sub-phone units. On a 12,000-word German recognition task with small overlap between training and test vocabulary a reduction in error rate by one fifth was achieved compared to the case without LDA. On the development set of the DARPA RM1 task the error rate wqs reduced by one third. For the DARPA speaker-dependent nogrammar case, the error rate averaged over 12 speakers was 9.9%. This was achieved with a recognizer employing LDA and a set of only 47 Viterbi-trained contextindependent phonemes.
منابع مشابه
The RWTH large vocabulary continuous speech recognition system
In this paper, we present an overview of the RWTH Aachen large vocabulary continuous speech recognizer. The recognizer is based on continuous density hidden Markov models and a time-synchronous left-to-right beam search strategy. Experimental results on the ARPA Wall Street Journal (WSJ) corpus verify the effects of several system components, namely linear discriminant analysis, vocal tract nor...
متن کاملRobust speech/non-speech detection using LDA applied to MFCC for continuous speech recognition
Continuous speech recognition applications need precise detection because the number of words to recognize is unknown and vocabulary words can be short. The speech/non-speech detection must be robust to the boundary precision. In this work, a new approach to evaluate detection algorithm for continuous speech recognition is presented. The speech/non-speech detection using energy parameter combin...
متن کاملClass definition in discriminant feature analysis
The aim of discriminant feature analysis techniques in the signal processing of speech recognition systems is to find a feature vector transformation which maps a high dimensional input vector onto a low dimensional vector while retaining a maximum amount of information in the feature vector to discriminate between predefined classes. This paper points out the significance of the definition of ...
متن کاملThe Rwth Speech Recognition System and Spoken Document Retrieval
In this paper, we present an overview of the RWTH Aachen large vocabulary continuous speech recognizer. The recognizer is based on continuous density hidden Markov models and a time-synchronous left-to-right beam search strategy. Experimental results on the ARPA Wall Street Journal (WSJ) corpus verify the effects of several system components, namely linear discriminant analysis, vocal tract nor...
متن کاملLanguage Identification for Under-Resourced Languages in the Basque Context
Automatic Speech Recognition (ASR) is a broad research area that absorbs many efforts from the research community. The interest on Multilingual Systems arouses in the Basque Country because there are three official languages (Basque, Spanish, and French), and there is much linguistic interaction among them, even if Basque has very different roots than the other two languages. The development of...
متن کامل